Genome-Wide Dissection,
Characterization, and Expression Profiling of Cotton GASA Genes Reveal their Importance
in Regulating Abiotic Stresses
Muhammad Shaban1, Aamir Hamid Khan2, Etrat Noor1, Abdul Qayyum1,
Waqas Malik1, Muhammad Shehzad3, Umar Akram4,
Ayesha Razzaq1 and Muhammad Adnan Tabassum5
1Genomics Lab,
Department of Plant Breeding and Genetics, Bahauddin Zakariya University,
Multan, Pakistan
2National
Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University,
Wuhan, Hubei 430070, P.R. China
3State Key Laboratory of Cotton Biology, Institute of
Cotton Research, Chinese Academy of Agricultural Sciences, Anyang 455000,
Henan, P.R. China
4Biotechnology
Research Institute, Chinese Academy of Agricultural Sciences, Beijing, P.R. China
5Joint International Research Laboratory of Agriculture and
Agri-Product Safety, The Ministry of Education of China, Institutes of Agricultural
Science and Technology Development, Yangzhou University, Jiangsu, China
*For correspondence: raoqayyim@bzu.edu.pk, drshabaan@outlook.com
Received 08 August 2020; Accepted 21 September 2020;
Published 10 December 2020
Abstract
Short amino acids constituting proteins of gibberellic
acid stimulated Arabidopsis (GASA) gene family are widely implicated in plant
growth, development and have potential in mitigating environmental stresses.
There is limited information about functions of these genes in cotton. In the
present study, a total of 116 GASA genes were screened from four species of
cotton. During phylogenetic clustering, these genes were distributed into three
groups based on their homology. Duplication analysis among three species of
cotton revealed that segmental duplication events might be the possible reason
for expansion and domestication of cultivated tetraploid cotton. Further,
chromosomal distributions of GASA genes on cotton chromosomes were found
uneven. The genes structure and motifs division pattern of GhGASA genes within same group was relatively conserved. Promoter
regions analysis of GhGASA genes
comprehend their involvement in a variety of plant mechanisms related to growth
and survival against environmental stresses. In tissue-specific expression
analysis, significantly higher induction of GhGASA
genes in various tissues of upland cotton revealed their importance in
development of these tissues. Additionally, differential expressions of GhGASA genes to multiple abiotic
stresses, especially against salt and cold stresses predicted their potential
roles in regulating these environmental cues. In conclusion, this is the first
comprehensive study regarding identification and investigation of cotton GASA
gene family. Data presented here provide important information for future
elucidating and characterizing potential target GASA genes related to abiotic
stress resistance in cotton. © 2021 Friends Science Publishers
Keywords: GASA genes; Cotton; Genome-wide; Tissue
expression; Stress responses
Introduction
Cysteine-rich peptide (CRP) is a large group of proteins
including thionines, lipid transfer proteins, defensins and GASA/Snakin. Each
CRP class can be distinguished from other based on the orientation and number
of cysteine residues in the primary sequence (Oliveira-Lima
et al. 2017). Recently, CRP proteins have been reported
extensively for their functions in plant development and manipulation of
environmental stresses (Balaji and Smart 2012; Haruta
et al. 2014; Ahmad et al. 2019; Li et al. 2019; Ahmad et
al. 2020). CRP proteins constitute a large gene family and widely
distributed in plants. The GASA (Gibberellic-acid stimulated Arabidopsis) a
subfamily of CRP, is short amino acids, low molecular weights, mostly
gibberellin regulated and is widely distributed in plants. It is comprised of
three distinct domains, 18–29 residues comprising N-terminal domain, a
C-terminal domain (called GASA domain) consisting of 12 conserved cysteine
residues and an intermediate highly divergent region between C and N terminals (Herzog et al. 1995). The fact that
number and position of C-terminal conserved residues remained same throughout
the evolution in different species might suggests their key role in defining
functions of this gene family (Ben-Nissan et
al. 2004). Previous studies have reported that C-terminal GASA
domain is essential for determining antioxidant activity (Wigoda et al. 2006; Nahirnak et al.
2012b) and formation of disulfide bond during protein folding (Porto and Franco 2013).
GASA gene family plays important
roles during different processes in plant life from seed germination to
maturity. Studies based on expression pattern analysis suggest that GASA genes
have specific spatial and temporal expression pattern and most of these are
expressed in young tissues and actively growing organs (Peng et al. 2010; Nahirnak et al. 2016). These are
not only the target genes responsible for specific functions but also act as
regulatory proteins to monitor plants signaling for growth and stress responses
(Ceserani et al. 2009; Zhang et al.
2009; Sun et al. 2013). Further, these genes control plant
hormonal level and hormonal signaling network to fine tune different
physiological processes in plants (Wang et al.
2009; Rubinovich et al. 2014). Additionally, transgenic studies,
homology and expression analysis indicate substantial involvement of GASA genes
in plant developmental processes by affecting various cellular processes (Shi et al. 1992; Aubert et al. 1998; Kotilainen
et al. 1999; Fuente et al. 2006; Furukawa et al. 2006;
Nahirnak et al. 2012a). More importantly, GASA family genes have
preferential roles in floral development and regulation of floral timing (Muhammad et al. 2019). Further, GASA
genes also modulate cell elongation (Ben-Nissan
and Weiss 1996), root growth (Zimmermann et
al. 2010) and stem development (Zhang
et al. 2009) and fruits ripening (Moyano-Canete
et al. 2013) in plants. Interestingly, some GASA family members
have opposite functions related to flowering in plants, such as AtGASA4 stimulates flowering (Roxrud et al. 2007) while AtGASA5 inhibits flowering in
Arabidopsis (Zhang et al. 2009).
In addition to plant growth and
developmental functions of GASA genes, some members of this family are also
involved in regulating plants stress responses. For example, increased
anti-bacterial and anti-fungal activities were correlated with potato Snakin-1
and Snakin-2 proteins during in vivo
experiments (Segura et al. 1999; Berrocal-Lobo
et al. 2002; Almasia et al. 2008; Kovalskaya and Hammond 2009; Balaji
and Smart 2012). Similarly, CaSn
protein enhanced pepper resistance against root-knot nematode (Mao et al. 2011). In Arabidopsis, AtGASA4 and AtGASA5
substantially enhanced tolerance to heat stress by affecting BiP gene
expression and regulating SA signaling pathway, respectively (Ko et al. 2007; Zhang and Wang 2011).
Further, constitutive expression of AtGASA14
increased tolerance to salinity by restricting accumulation of reactive oxygen
species (ROS) (Sun et al. 2013).
Likewise, overexpression of FsGASA4
improves plant tolerance to abiotic stresses by enhancing SA level and induced
expressions of SA signaling pathway genes (Alonso-Ramirez
et al. 2009).
Gibberellic Acid-Stimulated
Transcript 1 (GAST1) was the first GASA gene isolated and characterized in tomato during investigation of
GA-deficient gib mutant (Shi
et al. 1992). Later, with the advancement of sequencing
technologies, more members of GASA genes were reported in diverse plant species
including Arabidopsis thaliana (Herzog et al. 1995) wheat (Zhang et al. 2017), rice (Furukawa et al. 2006), maize (Zimmermann et al. 2010), apple (Fan et al. 2017), petunia (Ben-Nissan and Weiss 1996), potato (Nahirnak et al. 2016), soybean (Ahmad et al. 2019) and grapevine (Ahmad et al. 2020). GASA family
constitutes a large number of genes in some species, for example, in soybean 37
GASA/Snakin genes were identified (Ahmad et
al. 2019), similarly 16 in potato (Nahirnak
et al. 2016), 15 in Arabidopsis (Fan
et al. 2017), 14 in grapevine have been reported (Ahmad et al. 2020). The diversity and
number of GASA/Snakin genes identified in remotely related species depicts
their significance and suggest their essential roles in life of plants.
Cotton (G. hirsutum) is a
naturally fiber and oil producing crop of huge importance for textile and oil
industry of the world. The economy of many developing countries depends on the
sustainable production of cotton. As cotton is a mesophytic plant, its growth
and yield are severally affected by both biotic and abiotic stresses (Dabbert and Gore 2014). Based on the
significance of GASA genes in regulating growth, development and responses to
different environmental stresses in multiple plant species, GASA family was
selected for systematic and comprehensive analysis in cotton. In this study, a
total 116 putative GASA genes were screened from four species of cotton. Their
phylogeny, synteny, motifs, structural features, cis-elements, spatiotemporal expressions in various tissues and
under abiotic stresses (cold, heat, salinity) was investigated in details. Our
results laid a foundation for further characterization of GASA family related
to growth, development and responses to different environmental stresses.
Materials and
Methods
Identification,
sequence analysis and properties of GASA gene family in Gossypium spp.
All the reported Arabidopsis GASA family genes were
downloaded (Fan et al. 2017).
Afterwards, BLASTP programme (Zhu et al.
2017) with e-value 1e-10 was run online to find candidate
GASA genes from four species of cotton including G. hirsutum, G. barbadense,
G. arboreum and G. raimondii using 15 Arabidopsis GASA
genes as query. Subsequently, manual and online databases including NCBI
conserved domain (https://ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) (Marchler-Bauer et al. 2015) Pfam (http://pfam.xfam.org/)
(El-Gebali et al. 2019), and SMART
(http://smart.embl-heidelberg.de/) (Letunic et
al. 2015) were accessed to confirm GASA domain of Pfam (PF02704)
from all the members of GASA family. Further, physio-chemical properties of
GASA genes including theoretical molecular weight, protein length and
isoelectric point were determined through ExPASy web tool
(http://web.expasy.org/) (Bjellqvist et al.
1994).
Chromosomal
mapping, synteny and duplication analysis
Chromosomal positions of GASA genes were
plotted on cotton chromosomes using CIRCOS software. Based on the gene
position, the distribution of each GASA gene was analyzed. Further, homologous
genes among G. hirsutum, G. arboreum and G. raimondii were
screened using BLASTP program with similarity > 80% and alignment percentage
> 80% compared to total length of proteins (Yang
et al. 2008). The sequences of orthologous genes were aligned
using Clustal Omega (http://www.ebi.ac.uk/Tools/msa/clusalo/). Subsequently,
aligned sequences were submitted to PAL2NAL software (http://www.bork.embl.de/pal2nal/)
(Suyama et al. 2006) for determination
of synonymous (Ks), non-synonymous (Ka) substitution rates and their ratios
among duplicated gene pairs. Separately, GhGASA
genes were also mapped on chromosomes using Mapchart software (Version 2.0) (Voorrips 2002).
Phylogenetic tree
construction
The phylogenetic tree between GhGASA, GbGASA, GaGASA and GrGASA and Arabidopsis GASA genes was constructed using their protein
sequences. Firstly, multiple sequence alignment of protein sequences was
generated using ClustalW software (Thompson et
al. 2002). Later, this alignment file was submitted in MEGA 6.0
software (Tamura et al. 2013) for
generation of phylogenetic tree. Neighbor-joining (NJ) method was adopted with
parameters as follows, bootstrap values: 1000 replicate, pairwise deletion,
Poisson correction and uniform rates.
Gene structure,
motifs prediction and cis-elements
analysis
Coding DNA sequence and corresponding Genomic sequence
of each GhGASA gene was applied in
online tool Gene Structure Display Server (GSDS, V.2) (http://gsds.cbi.pku.edu.cn/)
(Hu et al. 2015) to construct gene
model showing organization and number of exon-introns. The conserved motifs in GhGASA genes were predicted using MEME
suits (Multiple Expectation Maximization for Motif Elicitation) (Bailey et al. 2015). Further,
annotations of these motifs were obtained from Pfam database (http://pfam.xfam.org/)
(El-Gebali et al. 2019). 1.5 kb
upstream 5’ flanking region of each GhGASA
gene was retrieved from CottonFGD website (https://cottonfgd.org/analyze/) (Zhu et al. 2017) and subsequently subjected to
Plant-CARE program (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/)
(Lescot et al. 2002) for
prediction of potential cis-acting
elements.
Abiotic stress
treatments
Seeds of upland cotton (G. hirsutum cv YZ1) were
soaked in water and put at 28°C for one night. Next day drained out the excess
moisture and put back at 28°C in incubators to induce germination, seeds with
good growth potential were selected and planted into soil pots filled with
commercially available sterilized composted soil under controlled conditions in
growth chamber. The temperature, humidity and photoperiod cycle was set as 28°C,
60%, 16 h light/8 h dark period, respectively. Uniform cotton seedlings at 3–4
leaf stage were selected and subsequently subjected to salt (200 mmol/L NaCl)
and heat stress (38°C) for 48 h. Alternatively, control seedlings were watered
normally and used as mock. Leaf samples were collected after 72 h from both
control and salt treated plants, immediately kept in liquid nitrogen and stored
at -80°C for subsequent RNA extraction.
Expression pattern
analysis
The plant total RNA from all samples was extracted using
the method as reported earlier by Tu et al.
(2007). After dilution,
RNA was reverse transcribed into cDNA using the reverse transcription kit
(Promega, U.S.A.). qRT-PCR was run as reported earlier by Xu et al. (2014) with thermal cycles
was as follows, 95°C for 15 s, 60°C for 1 min and 95°C for 15 s. The 2-∆∆Ct
method was adopted to compare the gene expression values and GhUBQ7 (DQ116441) gene (Xu et al. 2014) served as an internal
control. Primers used for expression analysis are enlisted in Table S10.
For expression pattern analysis
of GhGASA genes, publically available
transcriptomic data related to different tissues (roots, stems, leaves, petals,
ovules, seeds) and under multiple stresses including cold, heat, PEG and
salinity were retrieved from Cotton FGD (https://cottonfgd.org/) (Zhu et al. 2017). Afterwards, Genesis
software (Version 1.0) was used to normalize expression values and generation
of heatmap (Sturn et al. 2002).
Results
Identification and
characterization of GASA genes in Gossypium
spp.
The genome-wide identification of GASA gene family in
four species of cotton (G. hirsutum, G. barbadense, G. arboreum
and G. raimondii) was carried out through using the Arabidopsis GASA domain as query in BLASTP search against the
corresponding genome of four cotton species. As a result, total 116 GASA genes
were obtained from the studied four Gossypium
species (Table S1). The numbers of putative GASA genes were as 40 in G. hirsutum, 32 in G. barbadense, 22 in G. arboreum
and 22 in G. raimondii. These identified genes were further manually checked
using NCBI conserved
domain, SMART and
Pfam databases. All the genes possessed the putative GASA domain as reported in
previous studies (Fan et al. 2017; Zhang
and Wang 2017; Ahmad et al. 2020). Further, these genes were
named based on their chromosomal locations. Multiple sequence alignment showed
that all these GASA family genes harbored 12 conserved cysteine residues, a
characteristic motif of GASA gene family (Fig. S2). This motif is responsible
for maintaining stability and structure and of GASA proteins (Betz 1993; Darby and Creighton 1995).
Additionally, protein properties including length, molecular weights and
isoelectric points of all the predicted GASA family members were analyzed using
ExPASy program (Table S2).
A circos plot was created between
two diploid species (G. arboreum, G. raimondii) and
subgenomes of allotetraploid specie (G.
hirsutum) to visualize the
chromosomal distribution and syntenic relationship of GASA family genes. As
shown in Fig. 1, all the GASA genes of studied species are evenly distributed
on chromosomes. As GaGASA, GrGASA and GhGASA genes were found to be
distributed on 10 chromosomes of A, D, and At and Dt subgenomes,
respectively (Table S3). The distribution was uneven, with chromosomes A05 and
A09 harbored 6 and 4 GhGASA genes
while chromosomes A06, D05, D07, D09 and A07, D03, D04, D06 possess 3 and 2 GhGASA genes, respectively. Other
chromosomes A02, A03, A04, A10, A11, A12, D02, D10, D11, and D12 contained
single GhGASA gene (Fig. S1).
Additionally, 40 and 42 duplicated gene pairs were identified from At to A2 and
Dt to D5, respectively and members of each duplicated gene pairs exhibited
great similarity to each other (Table S4).
Additional duplication analysis
among GhGASA genes found 14
duplicated gene pairs and all of these pair experienced segmental duplications
(Table S5). Further, All the duplicated GhGASA
genes experienced purifying selection except two duplicated genes pairs (GhGASA10-GhGASA31 and GhGASA20-GhGASA39) that undergoes positive
selection (Table S5).
Phylogenetic
clustering and Structural properties of GASA genes in cotton
For estimation of phylogenetic relationship among
members of GASA family, a phylogenetic tree was constructed using amino acids
sequences of model plant Arabidopsis and four cotton species. According to
phylogenetic tree, GASA genes of cotton and Arabidopsis were clustered into
three groups (G1, G2 and G3) based on their homology and protein structures
(Fig. 2). Group 3 contained highest number of GASA family members 51, while
group 1 and 2 possess 41 and 39 GASA genes, respectively.
The evolutionary relationship
based on genes structural diversity was considered an important component in
the study of multigene families. To study the structural similarity or
differences among putative GhGASA
genes, an exon-intron map was constructed (Fig. 3A). Results revealed variation
in number of exons among GhGASA genes
that fluctuated from 2 to 5, with highest number of introns and exons was found
in GhGASA17 (introns: 4, exons: 5).
Expectedly, the number and composition of exon-intron between closely related
members was same within the same group. More variations in number of exons and
introns were observed among group 3 members. All members of group 2 contained
one intron and 2 exons, while all members of group 1 include 3 introns and 4
exons, with the exceptions of GhGASA19
and GhGASA38 that contained 2 introns
and 3 exons (Fig. 3). To get some perceptible information about paralogous GhGASA genes in phylogenetic tree, we
analyzed their exon-intron structures. Interestingly, most of the paralogous GhGASA genes harbored same number and
orientation of exons-introns. For example, GhGASA1/30 possessed three exons and two introns
but some variations were observed in intron length. Probably, these relations
have formed the size and structures of putative GASA genes in cotton.
To further explore the
structures and features of GhGASA
gene family, MEME server was accessed for finding distinctive or similar motifs
in GhGASA genes. Ten distinct motifs
of different length were found among 40 GhGASA
genes (Fig. 3B) and annotated using SMART and Pfam servers. Only motif 1 was
found to be the representative of GASA domain, while the functions of other
motifs were unknown (Table S6). Normally, most of the closely related members
harbored similar motifs within the same group, proposing their similar function.
For example, GhGASA7/12, GhGASA14/34, GhGASA16/36, GhGASA9/25
and GhGASA3/26 shared similar motifs.
In detail investigation found that only motif 3 was present in all GhGASA genes, while motifs 4, 6, 7, 8
were absent in all members of group 1 and 2. Moreover, some motifs were found
to be specific to special GhGASA
members, such as motif 10 was only found in GhGASA1
and GhGASA24. However, it is not
known whether these motifs confer unique functions to these GhGASA genes or not.
Promoter analysis
of GhGASA genes
Cis-elements are considered to
participate in controlling expression of genes. To explore the probable
association of these cis-elements
with the expression or functions of GhGASA
genes, 1.5 kb promoter region of each GhGASA
genes was extracted and analyzed using PLANTCARE website. A diversity of cis-elements responsive to growth,
development, stress, light and phytohormones were identified. Among 40 GhGASA genes, light responsive elements
were predominant (57%), followed by hormones responsive (19%), growth and
development related (14%) and environmental stress responsive (10%) (Fig. 4A
and Table S7).
Fig. 1:
Synteny analysis of GASA gene family
Syntenic association between two
diploid species (G. arboreum, G. raimondii) and one
tetraploid specie (G. hirsutum) was generated through circos
program. Different chromosomes of selected species are marked with different
colors
Fig. 2: Phylogenetic
relationship of GASA genes from Arabidopsis
and Gossypium species
Phylogenetic tree was constructed by MEGA 6.0 software using
neighbor-joining (NJ) method with 1000 replicates. The GASA genes from G. hirsutum,
G. raimondii, G. arboreum, G. barbadense and Arabidopsis were marked with different
colors and shapes. Group1, Group 2 and Group 3 were indicated in green, red and
blue colors, respectively
Fig. 4:
Promoter region analysis of GhGASA
genes
A. Percentage of cis-acting elements in the promoter regions of GhGASA genes, different colors corresponds to percentage of
specific cis-elements category. B.
presence of cis-elements in promoter
region of each GhGASA gene was shown
in column and in different colors
The pattern of cis-elements
varied among GhGASA genes.
Surprisingly, GhGASA11 harbored only
one site for auxin from hormones responsive elements but possess highest number
of low temperature responsive cis-elements.
Moreover, GhGASA9 possessed highest
number of gibberellin responsive sites among other GhGASA genes (Fig. 4B and Table S7).
Differential tissue-specific
expression of GhGASA genes in upland
cotton
To investigate the expression pattern of GhGASA genes in different tissues of cotton, a heatmap was generated.
Results revealed that all the GhGASA
genes were expressed in at least one tissue, except GhGASA19 and GhGASA29
(Fig. 5 and Table S8). However, three genes (GhGASA17, GhGASA20, and GhGASA35) expressed in all the seven
tissues at all the time points [Fragments per kilobase of transcript per
million mapped reads (FPKM ≥1)]. Moreover, four genes (GhGASA10, GhGASA17, GhGASA20, and GhGASA35)
highly expressed in roots (FPKM ≥20) and six
Fig. 5:
Expression patterns of GhGASA genes
in various tissues of upland cotton.
The RNA-seq data related to organ-specific expression were accessed from
CottonFGD (https://cottonfgd.org/) and Genesis software package was
used for generation of heatmaps. DPA (days post anthesis), SW0 (sowing in water
0 h), SW5 (sowing in water 5 h)
genes (GhGASA4,
GhGASA9, GhGASA27, GhGASA28, GhGASA30 and GhGASA32) were found to be dominantly induced in leaves (FPKM
≥5). While, four genes (GhGASA13,
GhGASA17, GhGASA31 and GhGASA39)
were highly induced in petals (FPKM ≥51) and five genes (GhGASA6, GhGASA8, GhGASA14, GhGASA24, and GhGASA34 in ovules (FPKM ≥2.7). Expression of some genes
found to be specific to single tissue, such as GhGASA11& GhGASA12
only expressed in fiber (20 DPA).
Differential
expression of GhGASA genes under
multiple abiotic stresses
Considering important functions of GASA genes against
various environmental stresses in a number of plant species, firstly, we thoroughly
investigated the expression pattern of GhGASA
genes using published transcriptomic data of cotton treated with heat, cold,
polyethylene glycol (PEG) and salt (NaCl). From the results, it was observed
that all GhGASA genes showed altered
expression under one or more stress conditions except five GhGASA genes (GhGASA5, GhGASA19, GhGASA23, GhGASA29, GhGASA40) that were not expressed under
any stress condition (Fig. 6 and Table S9). Comparing four treatments, more
numbers of genes were differentially expressed against salt and cold stresses
as compared with heat and PEG. In response to cold stress, seven GhGASA genes highly induced during 3 h
time period and GhGASA6/16 had the
highest expression. Under heat stress, five genes showed increased expression
at 12 h time period, with GhGASA7 had
the highest transcript abundance. Similarly, under salt stress seven GhGASA genes were highly expressed at 12
h time period (treatment RPKM/control RPKM ≥ 2.5) with highest expression
noted for GhGASA26. Interestingly,
some genes only expressed under specific stress condition, such as GhGASA7/21 and GhGASA6/16 only expressed in response to heat (12 h) and cold (3 h)
(treatment RPKM/control RPKM ≥ 3). This specific expression pattern might
support their involvement in modulating these stress responses in cotton. Secondly,
to validate expression profile of GhGASA
genes obtained through transcriptomic data, we choose eleven GhGASA genes based on their higher
expression under salt or heat stress for further analysis through qRT-PCR.
Similar to transcriptomic profile of GhGASA
genes, most of the selected genes were up-regulated under salt stress as
compared to heat stress. Four GhGASA
genes including GhGASA18, GhGASA22, GhGASA36, and GhGASA37
were highly expressed under salt stress. Expression of three genes including GhGASA20, GhGASA33, and GhGASA39 was more under heat stress as
compared to salt stress. However, three genes GhGAS24, GhGASA30 and GhGASA32 were down-regulated both under
salt and heat stress.
Discussion
Earlier studies comprehensively showed the roles of GASA
genes in monitoring plant developmental processes and responses to various
stress responses in various plant species (Nahirnak
et al. 2012b; Zhang and Wang 2017). However, previously it was
not focused to identify and characterize GASA genes in cotton. In this study, a
total of 116 putative GASA genes were identified from four species of cotton.
Multiple sequence alignment of all these genes highlighted the existence of 12
amino acids conserved cysteine residues, a motif considered essential for
maintaining structure and functions of GASA genes in different plant species (Betz 1993; Darby and Creighton 1995; Silverstein et
al. 2007; Muhammad et al. 2019). In G. hirsutum, 40 GASA
genes were identified, which is relatively a larger gene family among other
three species of cotton (G. barbadense, G. arboreum and G. raimondii)
and other reported plant species (Fan et al.
2017; Ahmad et al. 2019). Interestingly, all the GaGASA, GrGASA, and GhGASA are
evenly distributed on 10 chromosomes of A, D and At and Dt subgenomes,
respectively.
Gene duplication is an important
mean for functional diversification, evolution and expansion of gene family (Kong et al. 2007). There are three ways
through these duplication events takes in plants. Segmental duplications
prevails when distribution of duplicated gene pairs are on different
chromosomes, while tandem duplication occurs when duplicated genes are located
on the same chromosomes. Duplication analysis among GhGASA genes revealed that segmental duplication is the leading
cause for expansion of GhGASA gene
family. Further, synonymous (Ks), non-synonymous (Ka) substitution rates and
their ratio were computed to investigate the duplication mechanism of GhGASA genes after being diverged from
their ancestor. The Ka/Ks ratio equal to 1 shows neutral selection, while
greater than 1 and less than 1 indicates positive and purifying selection,
respectively (Hurst et al. 2002).
Interestingly, in our study most of the duplicated GhGASA genes experienced purifying selection mechanism. Additional
systemic analysis is needed to explore further insights in the evolution of
cotton GASA gene family.
Fig. 6:
Expression pattern of GhGASA genes
against different abiotic stresses in upland cotton
A. Transcriptomic data related to heat, cold, PEG and
drought were accessed from CottonFGD website (https://cottonfgd.org/),
normalization and visualization of these data were performed using Genesis
software package. B. Expression pattern of selected GhGASA genes under salt and heat stress by qRT-PCR. Columns
represent the mean of three biological repeats and error bars represents the
standard deviation
In phylogenetic tree, all the
GASA genes were distributed in three groups (G1, G2 and G3) based on their
homology and structures. G3 contained more number of GASA genes than others.
Similar kind of grouping among GASA family genes were also observed in other
reported plant species (Zimmermann et al.
2010). Gene structure analysis showed that most of GhGASA genes within specific group harbored similar exons-introns
number and organization corresponding to their conserved functionality.
However, some variations were observed among group 3 members that indicate
their functional diversity. In terms of exons-introns numbers, G2 was found to
be more conserved than others, suggesting that other groups might gain or lost
exons during evolutionary process leading to differences in structures. Similar
trend of exons-introns conservation among G2 members was also noted in previous
study (Ahmad et al. 2020). To
further explore GhGASA genes in
detail, we analyzed their motifs distribution. Expectedly, most of closely
related GhGASA genes with in specific
groups constitute similar motifs. Moreover, some specific motifs were found to
be associated with some special GhGASA
genes. However, it is not known whether these motifs confer unique functions to
these GhGASA genes or not. In any
case, conservation of these motifs within and between groups further supports
the results of phylogenetic tree. Cis-elements
present in gene promoter normally regulate gene expression and confers unique
functionality to genes (Biłas et al.
2016). To study important roles of GhGASA
genes under changing environmental conditions and various physiological
processes of plants, we thoroughly investigated the promoter regions of each GhGASA gene. Mainly four types of
elements were observed including growth, development-related, light, hormones
and environmental stress-responsive elements. The diversity of cis-elements in GhGASA genes agree with the reported multidimensional functions of
GASA genes in different plants (Oliveira-Lima et
al. 2017; An et al. 2018; Li et al. 2019; Muhammad et al.
2019).
Expression pattern of genes
provide useful clues for their functional characterization. Previously, it has
been noted that GASA genes have spatiotemporal specificity in various plant
species which might be due their probable involvement in diverse hormone
signaling pathways (Wang et al. 2009;
Zhang and Wang 2017). In present study, expression analysis of GhGASA genes in various tissues of
cotton showed that maximum number of GhGASA
genes induced in roots and ovules, signifying their important roles in
development of these tissues. Moreover, considering the transcript abundance of
GhGASA genes in cotton and the
published role of their orthologs in Arabidopsis
helped us to predict their probable functions. For example, GhGASA35 along with higher expression in
roots also strongly induced in seeds (SW5) (Fig. 5; Table S9) and its artholog AtGASA4 in Arabidopsis regulates seed germination (Roxrud et al. 2007; Rubinovich and Weiss 2010). Similarly, GhGASA2 did not expressed in any tissue
except seed (SW0) and its orthologous gene AtGASA10
reported to have important roles in seed germinations (Trapalis et al. 2017). This suggests that might be GhGASA35 and GhGASA2 have related roles in cotton as AtGASA4 and AtGASA10 in Arabidopsis. Nevertheless, this trend
was not consistent to all GhGASA
genes, signifying that these GASA genes might be undergone functional
diversification across species.
The quality and yield of cotton
is substantially affected by multiple abiotic stresses during the development
of plant (Hassan et al. 2020).
Therefore, we comprehensively studied the transcript abundance of GhGASA genes under several environmental
cues including heat, cold, PEG and salt stress using published transcriptomic
data. The results showed that most of GhGASA
genes changed their expression under one or more stress conditions.
Additionally, more numbers of GhGASA
genes were highly induced under cold and salt stresses as compared with heat
and PEG stresses, suggesting their probable function in monitoring these
stresses in cotton. Further, qRT-PCR analysis of selected GhGASA genes under salt and heat stress supported the results of
transcriptomic profile. Moreover, by comparing GhGASA genes in response to stress conditions and the existence of
relevant cis-elements in promoter
regions of those genes further support their effectiveness against these
stresses. For example, GhGASA26 was
highly induced under salt stress (12 h) and contain maximum number of stress
responsive elements in its promoter region. Interestingly, some GhGASA genes only induced under specific
stress treatment, such as GhGASA7/21
and GhGASA6/16 only expressed in
response to heat (12 h) and cold (3 h) (treatment RPKM/control RPKM ≥ 3).
This specific expression of selective genes might further support their
importance in regulating these stress responses.
Conclusion
In silico analysis of cotton genomes enabled us to investigate GASA
family genes in details. By adopting systematic approach consisting of
conserved domain analysis, chromosomal distribution patterns, synteny,
phylogeny, exons-introns organization and motifs division analysis,
comprehensive characteristic features of GASA genes in cotton were elucidated.
Promoter region analysis of GhGASA
genes supported their involvement in a variety of biological functions in
plants. Moreover, spatiotemporal tissue specific expression of GhGASA in upland cotton showed that most
of GhGASA genes were induced in
leaves and ovules than other. Additionally, qRT-PCR and transcriptomic
expression profiles of GhGASA genes
under various abiotic stress factors suggested that most of GhGASA genes have the capacity to
regulate tolerance against multiple abiotic stresses. In short, the present
genome-wide investigation of GhGASA
gene family provides potential information for future more focused studies
regarding in-depth characterization of GASA genes in cotton.
Acknowledgement
This study is a part of Interim Placement of Fresh Ph.D.
program granted by Higher Education Commission of Pakistan.
Author Contributions
MS designed and wrote manuscript. AK, EN, MS, UA, AR
helped in performing experiments and analyzing data. WM and MT helped in
revising manuscript. AQ gave suggestions to improve the experimental work.
References
Ahmad B, J Yao, S
Zhang, X Li, X Zhang, V Yadav, X Wang (2020). Genome-wide characterization and
expression profiling of GASA genes during different stages of seed development
in Grapevine (Vitis vinifera L.)
predict their involvement in seed development. Intl J Mol Sci 21:1088–1103
Ahmad MZ, A Sana, A
Jamil, JA Nasir, S Ahmed, MU Hameed, Abdullah (2019). A genome-wide approach to
the comprehensive analysis of GASA gene family in Glycine max. Plant Mol Biol
100:607‒620
Almasia NI, AA
Bazzini, HE Hopp, C Vazquez-Rovere (2008). Overexpression of snakin-1 gene
enhances resistance to Rhizoctonia solani
and Erwinia carotovora in transgenic
potato plants. Mol Plant Pathol 9:329‒338
Alonso-Ramirez A, D Rodriguez, D Reyes, JA
Jimenez, G Nicolas, M Lopez-Climent, A Gomez-Cadenas, C Nicolas (2009). Evidence for a role of gibberellins in salicylic
acid-modulated early plant responses to abiotic stress in Arabidopsis seeds. Plant
Physiol 150:1335‒1344
An B, QN Wang, XD
Zhang, B Zhang, HL Luo, CZ He (2018). Comprehensive transcriptional and
functional analyses of HbGASA genes reveal their roles in fungal pathogen
resistance in Hevea brasiliensis. Tree Genet Genomics 14; Article 41
Aubert D, M
Chevillard, AM Dorne, G Arlaud, M Herzog (1998). Expression patterns of GASA
genes in Arabidopsis thaliana: The
GASA4 gene is up-regulated by gibberellins in meristematic regions. Plant Mol Biol 36:871‒883
Bailey TL, J Johnson,
CE Grant, WS Noble (2015). The MEME Suite. Nucl
Acids Res 43:39‒49
Balaji V, CD Smart
(2012). Over-expression of snakin-2 and extensin-like protein genes restricts
pathogen invasiveness and enhances tolerance to Clavibacter michiganensis subspp. michiganensis in transgenic
tomato (Solanum lycopersicum). Trans Res 21:23‒37
Ben-Nissan G, D Weiss
(1996). The petunia homologue of tomato gast1:
Transcript accumulation coincides with gibberellin-induced corolla cell
elongation. Plant Mol Biol
32:1067‒1074
Ben-Nissan G, JY Lee,
A Borohov, D Weiss (2004). GIP, a Petunia
hybrida GA-induced cysteine-rich protein: A possible role in shoot
elongation and transition to flowering. Plant
J 37:229‒238
Berrocal-Lobo M, A
Segura, M Moreno, G Lopez, F Garcia-Olmedo, A Molina (2002). Snakin-2, an
antimicrobial peptide from potato whose gene is locally induced by wounding and
responds to pathogen infection. Plant
Physiol 128:951‒961
Betz SF (1993).
Disulfide bonds and the stability of globular proteins. Protein Sci
2:1551‒1558
Biłas R, K
Szafran, K Hnatuszko-Konka, AK Kononowicz (2016). Cis-regulatory elements used to control gene expression in plants. Plant Cell Tiss Org Cult 127:269‒287
Bjellqvist B, B
Basse, E Olsen, JE Celis (1994). Reference points for comparisons of
two-dimensional maps of proteins from different human cell types defined in a
pH scale where isoelectric points correlate with polypeptide compositions. Electrophoresis 15:529‒539
Ceserani T, A Trofka,
N Gandotra, T Nelson (2009). VH1/BRL2 receptor-like kinase interacts with
vascular-specific adaptor proteins VIT and VIK to influence leaf venation. Plant J 57:1000‒1014
Dabbert T, MAJ Gore
(2014). Challenges and perspectives on improving heat and drought stress
resilience in cotton. J Cotton Sci
18:393‒409
Darby N, TE Creighton
(1995) Disulfide bonds in protein folding and stability. In: Protein Stability and Folding, pp:219‒252. Springer,
Dordrescht, The Netherlands
El-Gebali S, J Mistry, A Bateman, SR Eddy,
A Luciani, SC Potter, M Qureshi, LJ Richardson, GA Salazar, A Smart, ELL
Sonnhammer, L Hirsh, L Paladin, D Piovesan, SCE Tosatto, RD Finn (2019). The
Pfam protein families database in 2019. The
Pfam protein families database in 2019. Nucl
Acids Res 47:427‒432
Fan S, D Zhang, L Zhang, C Gao, M Xin, MM
Tahir, Y Li, J Ma, M Han (2017).
Comprehensive analysis of GASA family members in the Malus domestica genome: Identification, characterization, and their
expressions in response to apple flower induction. BMC Genomics 18; Article 827
Fuente JIDL, I Amaya,
C Castillejo, JF Sanchez-Sevilla, MA Quesada, MA Botella, V Valpuesta (2006).
The strawberry gene FaGAST affects plant growth through inhibition of cell
elongation. J Exp Bot 57:2401‒2411
Furukawa T, N
Sakaguchi, H Shimada (2006). Two OsGASR genes, rice GAST homologue genes that
are abundant in proliferating tissues, show different expression patterns in
developing panicles. Genes Genet Syst
81:171‒180
Haruta M, G Sabat, K
Stecker, BB Minkoff, MR Sussman (2014). A peptide hormone and its receptor
protein kinase regulate plant cell expansion. Science 343:408‒411
Hassan A, M Ijaz, A
Sattar, A Sher, I Rasheed, MZ Saleem, I Hussain (2020). Abiotic Stress
Tolerance in Cotton. In: Advances in
Cotton Research. IntechOpen, London, UK
Herzog M, AM Dorne, F
Grellet (1995). GASA, a gibberellin-regulated gene family from Arabidopsis thaliana related to the
tomato GAST1 gene. Plant Mol Biol 27:743‒752
Hu B, J Jin, AY Guo, H Zhang, J Luo, G Gao
(2015). GSDS 2.0: An upgraded gene feature visualization server. Bioinformatics 31:1296‒1297
Hurst LD, EJ Williams,
C Pal (2002). Natural selection promotes the conservation of linkage of
co-expressed genes. Trends Genet
18:604‒606
Ko CB, YM Woo, DJ Lee,
MC Lee, CS Kim (2007). Enhanced tolerance to heat stress in transgenic plants
expressing the GASA4 gene. Plant Physiol
Biochem 45:722‒728
Kong H, LL Landherr,
MW Frohlich, J Leebens-Mack, H Ma, CW dePamphilis (2007). Patterns of gene
duplication in the plant SKP1 gene family in angiosperms: evidence for multiple
mechanisms of rapid gene birth. Plant J
50:873‒885
Kotilainen M, Y
Helariutta, M Mehto, E Pollanen, VA Albert, P Elomaa, TH Teeri (1999). GEG
participates in the regulation of cell and organ shape during corolla and
carpel development in Gerbera hybrida.
Plant Cell 11:1093‒1104
Kovalskaya N, RW
Hammond (2009). Expression and functional characterization of the plant
antimicrobial snakin-1 and defensin recombinant proteins. Protein Expr Purif 63:12‒17
Lescot M, P Dehais, G
Thijs, K Marchal, Y Moreau, YVD Peer, P Rouze, S Rombauts (2002). PlantCARE, a
database of plant cis-acting
regulatory elements and a portal to tools for in silico analysis of promoter
sequences. Nucl Acids Res 30:325‒327
Letunic I, T Doerks, P
Bork (2015). SMART: recent updates, new developments and status in 2015. Nucl Acids Res 43:257‒260
Li X, S Shi, Q Tao, Y
Tao, J Miao, X Peng, C Li, Z Yang, Y Zhou, G Liang (2019). OsGASR9 positively
regulates grain size and yield in rice (Oryza
sativa). Plant Sci 286:17‒27
Mao ZC, JY Zheng, YS
Wang, GH Chen, YH Yang, DX Feng, BY Xie (2011). The new CaSn gene belonging to
the snakin family induces resistance against root-knot nematode infection in
pepper. Phytoparasitica 39:151‒164
Marchler-Bauer A, MK Derbyshire,
NR Gonzales, S Lu, F Chitsaz, LY Geer, RC Geer, J He, M Gwadz, DI Hurwitz, CJ
Lanczycki, F Lu, GH Marchler, JS Song, N Thanki, Z Wang, RA Yamashita, D Zhang,
C Zheng, SH Bryant (2015). CDD: NCBI's conserved domain database. Nucl Acids Res 43:222‒226
Moyano-Canete E, ML
Bellido, N Garcia-Caparros, L Medina-Puche, F Amil-Ruiz, JA Gonzalez-Reyes, JL
Caballero, J Munoz-Blanco, R Blanco-Portales (2013). FaGAST2, a strawberry
ripening-related gene, acts together with FaGAST1 to determine cell size of the
fruit receptacle. Plant Cell Physiol
54:218‒236
Muhammad I, WQ Li, XQ
Jing, MR Zhou, A Shalmani, M Ali, XY Wei, R Sharif, WT Liu, KM Chen (2019). A
systematic in silico prediction of gibberellic acid stimulated GASA family
members: A novel small peptide contributes to floral architecture and
transcriptomic changes induced by external stimuli in rice. J Plant Physiol 234–235:117‒132
Nahirnak V, M
Rivarola, MGD Urreta, N Paniego, HE Hopp, NI Almasia, C Vazquez-Rovere (2016).
Genome-wide analysis of the Snakin/GASA gene family in Solanum tuberosum cv. Kennebec. Amer
J Potato Res 93:172‒188
Nahirnak V, NI
Almasia, PV Fernandez, HE Hopp, JM Estevez, F Carrari, C Vazquez-Rovere
(2012a). Potato Snakin-1 gene silencing affects cell division, primary
metabolism, and cell wall composition. Plant
Physiol 158:252‒263
Nahirnak V, NI
Almasia, HE Hopp, C Vazquez-Rovere (2012b). Snakin/GASA proteins: involvement
in hormone crosstalk and redox homeostasis. Plant
Signal Behav 7:1004‒1008
Oliveira-Lima M, AM
Benko-Iseppon, J Neto, S Rodriguez-Decuadro, EA Kido, S Crovella, V Pandolfi
(2017). Snakin: Structure, roles and applications of a plant antimicrobial
peptide. Curr Protein Pept Sci 18:368‒374
Peng J, L Lai, X Wang
(2010). Temporal and spatial expression analysis of PRGL in Gerbera hybrida. Mol Biol Rep 37:3311‒3317
Porto WF, OL Franco
(2013). Theoretical structural insights into the snakin/GASA family. Peptides 44:163‒167
Roxrud I, SE Lid, JC
Fletcher, ED Schmidt, HG Opsahl-Sorteberg (2007). GASA4, one of the 14-member Arabidopsis GASA family of small
polypeptides, regulates flowering and seed development. Plant Cell Physiol 48:471‒483
Rubinovich L, D Weiss
(2010). The Arabidopsis cysteine-rich
protein GASA4 promotes GA responses and exhibits redox activity in bacteria and
in planta. Plant J 64:1018‒1027
Rubinovich L, S
Ruthstein, D Weiss (2014). The Arabidopsis
Cysteine-Rich GASA5 is a redox-active metalloprotein that suppresses
Gibberellin responses. Mol Plant
7:244‒247
Segura A, M Moreno, F
Madueno, A Molina, F Garcia-Olmedo (1999). Snakin-1, a peptide from potato that
is active against plant pathogens. Mol
Plant Microb 12:16‒23
Shi L, RT Gast, M Gopalraj, NE Olszewski (1992). Characterization
of a shoot-specific, GA3- and ABA-regulated gene from tomato. Plant J 2:153‒159
Silverstein KA, WA
Moskal, HC Wu, BA Underwood, MA Graham, CD Town, KA VandenBosch (2007). Small
cysteine-rich peptides resembling antimicrobial peptides have been
under-predicted in plants. Plant J
51:262‒280
Sturn A, J
Quackenbush, Z Trajanoski (2002). Genesis: cluster analysis of microarray data.
Bioinformatics 18:207‒208
Sun S, H Wang, H Yu, C
Zhong, X Zhang, J Peng, X Wang (2013). GASA14 regulates leaf expansion and
abiotic stress resistance by modulating reactive oxygen species accumulation. J Exp Bot 64:1637‒1647
Suyama M, D Torrents,
P Bork (2006). PAL2NAL: Robust conversion of protein sequence alignments into
the corresponding codon alignments. Nucl Acids
Res 34:609‒612
Tamura K, G Stecher, D
Peterson, A Filipski, S Kumar (2013). MEGA6: Molecular Evolutionary Genetics
Analysis version 6.0. Mol Biol Evol
30:2725‒2729
Thompson JD, TJ
Gibson, DG Higgins (2002). Multiple sequence alignment using ClustalW and
ClustalX. Curr Protoc Bioinform 1:2-3
Trapalis M, SF Li, RW
Parish (2017). The Arabidopsis GASA10
gene encodes a cell wall protein strongly expressed in developing anthers and
seeds. Plant Sci 260:71‒79
Tu LL, XL Zhang, SG
Liang, DQ Liu, LF Zhu, FC Zeng, YC Nie, XP Guo, FL Deng, JF Tan, L Xu (2007). Genes
expression analyses of sea-island cotton (Gossypium
barbadense L.) during fiber development. Plant Cell Rep 26:1309‒1320
Voorrips RE (2002).
MapChart: Software for the graphical presentation of linkage maps and QTLs. J Hered 93:77‒78
Wang L, Z Wang, YY Xu,
SH Joo, SK Kim, Z Xue, ZH Xu, ZY Wang, K Chong (2009). OsGSR1 is involved in
crosstalk between gibberellins and brassinosteroids in rice. Plant J 57:498‒510
Wigoda N, G
Ben-Nissan, D Granot, A Schwartz, D Weiss (2006). The gibberellin-induced,
cysteine-rich protein GIP2 from Petunia
hybrida exhibits in planta antioxidant activity. Plant J 48:796‒805
Xu L, W Zhang, X He, M
Liu, K Zhang, M Shaban, L Sun, J Zhu, Y Luo, D Yuan, X Zhang, L Zhu (2014). Functional
characterization of cotton genes responsive to Verticillium dahliae through bioinformatics and reverse genetics
strategies. J Exp Bot 65:6679‒6692
Yang S, X Zhang, JX
Yue, D Tian, JQ Chen (2008). Recent duplications dominate NBS-encoding gene
expansion in two woody species. Mol Genet
Genom 280:187‒198
Zhang LY, XL Geng, HY
Zhang, CL Zhou, AJ Zhao, F Wang, Y Zhao, XJ Tian, ZR Hu, MM Xin, YY Yao, ZF Ni,
QX Sun, HR Peng (2017). Isolation and characterization of heat-responsive gene
TaGASR1 from wheat (Triticum aestivum
L.). J Plant Biol 60:57‒65
Zhang S, X Wang (2017). One new kind of phytohormonal signaling
integrator: Up-and-coming GASA family genes. Plant Signal Behav 12:1-7
Zhang S, X Wang
(2011). Overexpression of GASA5 increases the sensitivity of Arabidopsis to heat
stress. J Plant Physiol
168:2093‒2101
Zhang S, C Yang, J
Peng, S Sun, X Wang (2009). GASA5, a regulator of flowering time and stem
growth in Arabidopsis thaliana. Plant Mol Biol 69:745‒759
Zhu T, C Liang, Z
Meng, G Sun, Z Meng, S Guo, R Zhang (2017). CottonFGD: an integrated functional
genomics database for cotton. BMC Plant
Biol 17:101-109
Zimmermann R, H Sakai, F Hochholdinger (2010). The
Gibberellic Acid Stimulated-Like gene family in maize and its role in lateral
root development. Plant Physiol
152:356‒365